Modeling auditory processing of amplitude modulation. I. Detection and masking with narrow-band carriers.
نویسندگان
چکیده
This paper presents a quantitative model for describing data from modulation-detection and modulation-masking experiments, which extends the model of the "effective" signal processing of the auditory system described in Dau et al. [J. Acoust. Soc. Am. 99, 3615-3622 (1996)]. The new element in the present model is a modulation filterbank, which exhibits two domains with different scaling. In the range 0-10 Hz, the modulation filters have a constant bandwidth of 5 Hz. Between 10 Hz and 1000 Hz a logarithmic scaling with a constant Q value of 2 was assumed. To preclude spectral effects in temporal processing, measurements and corresponding simulations were performed with stochastic narrow-band noise carriers at a high center frequency (5 kHz). For conditions in which the modulation rate (fmod) was smaller than half the bandwidth of the carrier (delta f), the model accounts for the low-pass characteristic in the threshold functions [e.g., Viemeister, J. Acoust. Soc. Am. 66, 1364-1380 (1979)]. In conditions with fmod > delta f/2, the model can account for the high-pass characteristic in the threshold function. In a further experiment, a classical masking paradigm for investigating frequency selectivity was adopted and translated to the modulation-frequency domain. Masked thresholds for sinusoidal test modulation in the presence of a competing modulation masker were measured and simulated as a function of the test modulation rate. In all cases, the model describes the experimental data to within a few dB. It is proposed that the typical low-pass characteristic of the temporal modulation transfer function observed with wide-band noise carriers is not due to "sluggishness" in the auditory system, but can instead be understood in terms of the interaction between modulation filters and the inherent fluctuations in the carrier.
منابع مشابه
A computational model of human auditory signal processing and perception.
A model of computational auditory signal-processing and perception that accounts for various aspects of simultaneous and nonsimultaneous masking in human listeners is presented. The model is based on the modulation filterbank model described by Dau et al. [J. Acoust. Soc. Am. 102, 2892 (1997)] but includes major changes at the peripheral and more central stages of processing. The model contains...
متن کاملAuditory perception of amplitude modulated sinusoid using a pure tone and band-limited noises as modulation signals
Frequency selectivity in amplitude modulated sound have been reported in terms of modulation threshold level for amplitude-modulation detection, where a pink or white noise carrier was modulated with a sinusoid and band-limited white noise, which showed similar band-pass type masking characteristics. Such previous studies treated band limited white noise carriers. In this paper, an amplitude-mo...
متن کاملModeling auditory processing of amplitude modulation
In this thesis a new modeling approach is developed which is able to predict human performance in a variety of experimental conditions related to modulation detection and modulation masking. Envelope uctuations are analyzed with a modulation lterbank. The parameters of the lterbank were adjusted to allow the model to account for modulation detection and modulation masking data with narrowband c...
متن کاملModeling auditory processing of amplitude modulation. II. Spectral and temporal integration.
A multi-channel model, describing the effects of spectral and temporal integration in amplitude-modulation detection for a stochastic noise carrier, is proposed and validated. The model is based on the modulation filterbank concept which was established in the accompanying paper [Dau et al., J. Acoust. Soc. Am. 102, 2892-2905 (1997)] for modulation perception in narrow-band conditions (single-c...
متن کاملModeling Auditory Perception for Robust Speech Recognition
Forward masking stimuli: (A) Large timescale view of a single 2AFC trial; (B) Fourier Transform of the probe signal (128 ms rectangular window); (C) Smaller timescale view of the probe following the masker by 15 ms.. Average forward masking data (circles), and std. dev. (error bars), together with the model fit (lines) as a function of masker level across 5 octaves, with probe delays of 15, 30,...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- The Journal of the Acoustical Society of America
دوره 102 5 Pt 1 شماره
صفحات -
تاریخ انتشار 1997